.com 


Commerce (Business Statistics and Research Methods) 



9. Probability Distribution 


In a probability distribution, the variables are distributed according to some definite 
probability function. Various types of probability distributions are as follows: 

(I) Binomial Distribution 


The Binomial distribution is also known as "the outcome of Bernoulli process" and is 
associated with the name of "jacob Bernoulli." A Bernoulli process is a random process in 
which : 


1 (a) The process is performed under the same conditions for a fixed and I 
finite number of trials, say, n. I 

(b) Each trial is independent of other trials, i.e., the probability of an I 
outcome for any particular trial is not influenced by the outcomes of I 
the other trials. I 

(c) Each trial has two mutually exclusive possible outcomes, such as I 
"success"or "failure", "good"or " defective", "yes"or "no", "hit"or I 
"miss", and so on. The outcomes are usually called success and I 
failure for convenience. I 

(d) The probability of success, P, remains constant from trial to trial I 
(so is the probability of failure q, where, q=l-p). I 

The general model for specifying the probability of obtaining exactly x successes in a 
given number of n, Bernoulli trials is given by 


F 'ocus 
ormula 


f(x) = P[X = x]= n C x p x q n x for x = 0,1,2, . ,n. 



where p = theprobability of a successon a single Bernoulli trial 
n = the number of Bernoulli trials, 
x - thenumberof successes in n trials. 


This formula for the probability distribution of the number of successes in series of Bernoulli trials 
is called the Binomial probability distribution. It gives the probability of obtaining exactly x successes 
and (n - x) failures in n Bernoulli trials. 

The Binomial distribution satisfies the two essential properties of probability distribution, 


viz., 


(i) f(x) > 0; and 

(ii) £f(x) = 1. 

Mean and Variance of Binomial Distribution 


The mean of binomial random variable X, is the theoretical expected number of successes 
in n trials. It is denoted by p. 



Cocus 

lormula 

v J 

jU = E(X)= ±xf(x) = Y.x"C x p x q ~ x 

x=0 

l u = np 

Here n = number of trials 

p - probability of success in a single trial 
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The variance of the binomial random variable X measures the variation of the binomial 
distribution. It is given as follows: 



Note: Binomial distribution may be negatively or positively skewed. 

Example : The mean of a binomial distribution is 40 and standard deviation 6. Calculate 
n, p and q. 

Solution : The mean of binomial distribution is given by np and standard deviation by 

^npq. 


Since Jnpq. = 6; npq = 36 and np = 40 


Therefore, 


40q = 36 or q 
p = 1 - q = 1 


— = 0.9 
40 

0.9 = 0.1 


40 40 

Given,np = 40 or n = — = — = 400. 

r p 0.1 

Hence for the given question, n = 400, p = 0.1 and q = 0.9. 


Example : Suppose that the half of the population of town are consumers of rice. One 
hundred investigators are appointed to find out its truth. Each investigator interviewed 10 
individuals. How many investigators do you expect to report that three or less of the people 
interviewed are consumers of rice ? 

Solution : Probability of a person being consumer of rice is p = 1/ 2, q = 1/ 2. Probability 
that three people or less are consumers of rice is given by 

P[X < 3] = P[X = 0] + P[X = 1] + P[X = 2] + P[X = 3] 

= q 10 + 10 C X q 9 p 1 + 10 C 2 q 8 p 2 + 10 C 3 q 7 p 3 


UJ 


10 


+ 10 


(-) 

UJ 


10 


+ 45 


(-) 

UJ 


10 


+ 120 


(-) 

UJ 


10 


10 


nr i76 

(1 + 10 + 45 + 120) = —— 
v ' 1024 




Therefore, the number of investigators to report that three or less people are consumers 
of rice is given by 


176 

1024 


x 100 = 17.2 = 17 approx. 
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(II) Poisson Distribution 

The characteristics of the Poisson distribution are as follows : 

1. The occurrence of the events is independent. That is, the occurrence of an event 
in an interval of space or time has no effect on the probability of a second 
occurrence of the event in the same, or any other interval. 

2. Theoretically, an infinite number of occurrences of the event must be possible in 
the interval. 

3. The probability of single occurrence of the event in a given interval is proportional 
to the length of the interval. 

4. In any infinitesimal (extremely small) portion of interval, the probability of two 
or more occurrences of the event is negligible. 

Poisson distribution differs from the binomial distribution in two important aspects : 

(a) Rather than consisting of discrete trials, the distribution operates continuously 
over some given amount of time, distance, area, etc. 

(b) Rather than producing a sequence of successes and failures, the distribution 
produces successes, which occur at random points in the specified time, distance, 
area. These successes are commonly referred to as 'occurrences'. 

The Poisson distribution may be used to approximate binomial distribution when n is 
large and p is small and, therefore, is regarded as the limit of the binomial distribution. 

The Poisson distribution is given by 

p m tti x 

f(x) = P(X = x) =- —; x = 0,1, 2 ... 

where, m is called the parameter of the distribution and is the average number of 
occurrences of random event, x is the number of occurrences of the random event and e is the 
constant whose value is 2.7183. 

The Poisson distribution satisfies the two essential properties, i.e., f(x) > 0 and Xf(x) = 1. 



The following are some of the examples which may be analysed with the use of 

~w 

this distribution: 

Did You 
Know ? 

(a) the demandfor aproduct, 

(b) typographical errors occurring on thepagesofabook, 

(c) the occurrence of accident in a factory, 

(d) the arri val pa ttem i n a departm entals tore, 

(e) the occurrence of flaws in a bolt in a factory, and 




(f) thearrival of calls at a switch board. 
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Mean and Variance of Poisson Distribution 

The mean of the Poisson Distribution is given by 



jx = E(X)= Exf(x)= 'YuX 


e m 


T ' 

k/V' • 


ix = m = np 

Thevariance of the Poisson Distribution is given by: 
a 2 = E(X 2 ) - [E(X)] 2 = E(X 2 ) -m 2 
a 2 = m = np 

SoinaPoisson Distribution mean andvariance are equal. 


Note : Poisson distribution is positively skewed. 

Example : What probability model is appropriate to describe a situation where 100 
misprints are distributed randomly throughout the 100 pages of a book ? For this model, what 
is the probability that a page observed at random will contain at least three misprints ? 

Solution : Since there are 100 misprints in 100 pages, it implies that there is only one 
mistake on the average in a page. Therefore, the probability of being a misprint is very small 
as a page contains large number of words and n the number of words in 100 pages will be very 
large. So, in this case probability of being a misprint is small and n is very large, therefore, 
Poisson distribution is best suited here. 

Average or expected number of misprints in one page is 
m = np = 100 x 0.01 =1 
e~ m = e -1 = 0.3679 


Probability of at least three misprints in a page is 

= P[X > 3] = 1 - P(X < 3) = 1 - [P(X = 0) + P(X = 1) + P(X 


= 2 )] 


= 1 - 


-1 -1 ^ 

e +e + 


2! 


= 1 - e- 1 [2.5] = 1 - 0.3679 (2.5) 
= 1 - 0.9198 = 0.0802. 


Fitting a Poisson Distribution: The process of fitting a Poisson distribution is simple. 
We have to obtain the values of m and calculate the probability of zero occurrences. The other 
probabilities can be easily calculated by the recurrence relation as follows : 


f(x) = 


x 

e m 


f 


X! 


and f(x + l)- 


e“ m m x+1 


(x + l)! 


f(x + l) 

f(x) 


m 


x + l 
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or f(x + 1) = 
when x = 0, 

when x = 1, 


m 


when x = 2, 
and so on. 


-i f( x ) 
x +1 v ' 

f(l) = m f(0) 


m m 

f(2) = — f(l) = — f(0) 


m m 

f(3) = — f(2) = — f(0) 


This recurrence relation provides a very convenient method for fitting the Poisson 
distribution. The only probability we need to know is f(0) = e m , where m is the only parameter 
of the Poisson distribution. 

Multiplying by N (the total frequency) each probability of the Poisson distribution. We 
get the expected frequencies for respective probabilities. 

(Ill) Normal Distribution 

The Normal Distribution was discovered by De Moivre in 1733. It is an approximate to binomial 
distribution. Whether or not p is equal to q, the binomial distribution tends to the form of the 
continuous curve when n becomes large. 

A random variable X is said to have a normal distribution with parameters p (mean) and 
a 2 (variance) if the density function is given by : 



y =f(x)= 


i 


a 


y/2 


X-n 


— co < X < oo 


n 


where y = the computed height of an ordinate at a distance of X from the 
mean, 

<j = Standard deviation of the given normal distribution, 
n- the constant -3.1416; 42x- 2.5066, 

e= the constant = 2.7183 (thebase of the system of natural logarithm), 
p = Mean of the given normal distribution. 


If we draw the graph of normal distribution, the curve obtained will be known as 
normal curve and is given below. 



The graph of y = f(x) is a famous 'bell shaped' curve. The top of the bell is directly above 
the mean p. For large value of a, the curve tends to flatten out and for small values of a, it has 
a sharp peak. 
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The equation to a normal curve corresponding to a particular distribution is given by 


N 

y = /5= e 

GV27C 


lfx-V 

2v cr , 


The quantity 


N 




in the above formula is equal to the maximum ordinate of the 


normal curve which will always occur at the mean of the distribution. 

Relation between Binomial, Poisson and Normal Distribution 

When n is large and the probability p of the occurrence of an event is close to zero so 
that np remains a finite constant, then the binomial distribution tends to Poisson distribution. 

Normal distribution is a limiting form of Binomial distribution under the following 
conditions: 

(a) n, the number of trials is very large, i.e. n —» oo, and 

(b) neither p nor q is very small. 


Properties of Normal Distribution 

1. The normal curve is symmetrical about the mean (skewness = 0). 

2. The height of the normal curve is at its maximum at the mean. 

3. There is one maximum point of the normal curve which occurs at the 
mean. The height of the curve declines as we go in either direction 
from the mean. 

4. The normal curve is unimodal, i.e., it has only one mode. 

6. The variable distributed according to the normal curve is a 
continuous one. 

7. First and third quartiles are equidistant from the median. 

8. Mean deviation about mean = 4/5a or, more precisely, 0.7979 times 
of standard deviation. 

9. Mean ± c, mean ± 2c, and mean ± 3c covers 68.27%, 95.45%, and 
99.73% area respectively. 


Example : How many workers have a salary between Rs. 4000 and Rs. 6500, if the 
arithmetic mean is Rs. 5000. Standard deviation is Rs. 1000 and number of worker is 15,000, if 
the salary of the worker is assumed to follow the normal law ? 

Solution : 
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4000 - 5000 
1000 


(left of the mean) 


6500 - 5000 „ r 

z, =-= 1.5 

2 1000 


(right of the mean) 


From the table, we find that 34.13% of workers fall between Rs. 4000 and Rs. 5000 and 
43.32% fall between Rs. 5000 and Rs. 6500. 

34.13 + 43.32 = 77.45% of workers have a salary between Rs. 4000 and Rs. 6500. 
Number of workers getting a salary Rs. 4000 and Rs. 6500 is given by 
0.7745 x 15,000 = 11,618 


(IV) Exponential Distribution 

A continuous random variable x is said to have an exponential distribution with parameter 
y if its probability density function is given by 

jp(x) = ke~ Xx If0<x<oo, ?t>0 

I =0, otherwise. 

The exponential variable takes values over an infinite range. 


It can be proved that: 

1. The exponential density function decreases in the range 0 too, the 
maximum ordinate of the curve occurring atx = 0 (the value being 
itself), 

2. Larger the value ofk steeper is the decline in the ordinate, even for 
small values ofx. 

3. Smaller the value ofk, flatter does the curve become and lies closer to 
X-axis. 

The mean and variance of the exponential distribution are given by 

Mean = — 
k 

Variance - — 

k 2 


Ques. Which of the following relating to normal distribution are not correct ? 

(i) Co-efficient of skewness is three. 

(ii) It is mesokurtic. 



Mean deviation for it is 
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Ans. 


(iv) 

ju ± 2 <t covers 95.45 % area. 


(v) 

Mean, median and mode are equal. 


(vi) 

The standard normal variate z has 

mean 

Codes 

• 

• 


(A) 

(i), (iii) and (vi) 

(B) 

(C) 

(i), (iii) and (v) 

(D) 

(A) 

• Co-efficient of skewness is three. 


one and SD zero. 

(NTA UGC-NET June 2015 P-II) 

(Hi), (iv) and (v) 

(i), (Hi) and (iv) 


2 

• Mean deviation for it is —°- 


• The standard normal variate z has mean one and SD zero. 
Given above options are incorrect relating to normal distribution. 
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